Capturing Errors in Written Chinese Words

نویسندگان

  • Chao-Lin Liu
  • Kan-Wen Tien
  • Min-Hua Lai
  • Yi-Hsuan Chuang
  • Shih-Hung Wu
چکیده

A collection of 3208 reported errors of Chinese words were analyzed. Among which, 7.2% involved rarely used character, and 98.4% were assigned common classifications of their causes by human subjects. In particular, 80% of the errors observed in writings of middle school students were related to the pronunciations and 30% were related to the compositions of words. Experimental results show that using intuitive Web-based statistics helped us capture only about 75% of these errors. In a related task, the Web-based statistics are useful for recommending incorrect characters for composing test items for "incorrect character identification" tests about 93% of the time.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Case Study of Cantonese Acquired Dysgraphia – Data for the Organization of the Orthographic Lexicon

This paper reports a case study of a Chinese-speaking dysgraphic patient, NMY. Among his written errors in written naming and writing-to-dictation were phonologically plausible errors. Since his semantic processing of content words were hypothesized to be largely intact, as evidenced by normal performance on non-verbal semantic tests and word-picture matching, the occurrence of such errors prov...

متن کامل

Phonological and Logographic Influences on Errors in Written Chinese Words

We analyze a collection of 3208 reported errors of Chinese words. Among these errors, 7.2% involved rarely used character, and 98.4% were assigned common classifications of their causes by human subjects. In particular, 80% of the errors observed in the writings of middle school students were related to the pronunciations and 30% were related to the logographs of the words. We conducted experim...

متن کامل

How Chinese Native Speakers Handle Written Style Material in Reading and its Application in Second Language Acquisition

In this paper, a new approach was employed to investigate the relationship between written Chinese and spoken Chinese. In a pilot study, how Chinese native speakers handle written style materials was investigated and compared with Chinese L2 learners. Preliminary findings reveal that both groups are aware of the variation of written levels in different genres. However, the written elements sele...

متن کامل

The Dependence of Frequency Distributions on Multiple Meanings of Words, Codes and Signs

The dependence of the frequency distributions due to multiple meanings of words in a text is investigated by deleting letters. By coding the words with fewer letters the number of meanings per coded word increases. This increase is measured and used as an input in a predictive theory. For a text written in English, the word-frequency distribution is broad and fat-tailed, whereas if the words ar...

متن کامل

تصحیح قیاسی برخی از عبارات دشوار شرح شطحیات

The current article aims at reviewing and correcting some difficult and obscure words in Description of Shathyyāt written by Roozbehān Baqali. Similar to the mystic texts, this book is found to use technical writing style which causes it to be one of the complicated mystic passages. Some complexities of this book, however, are assumed to be originated in errors and inaccuracies of text. A Compa...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2009